Overview

Brought to you by YData

Dataset statistics

Number of variables16
Number of observations4600
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory575.1 KiB
Average record size in memory128.0 B

Variable types

DateTime1
Numeric12
Categorical3

Alerts

Age is highly overall correlated with bathrooms and 2 other fieldsHigh correlation
Total_sqft is highly overall correlated with bathrooms and 5 other fieldsHigh correlation
bathrooms is highly overall correlated with Age and 6 other fieldsHigh correlation
bedrooms is highly overall correlated with Total_sqft and 3 other fieldsHigh correlation
floors is highly overall correlated with Age and 3 other fieldsHigh correlation
price is highly overall correlated with Total_sqft and 2 other fieldsHigh correlation
sqft_above is highly overall correlated with Total_sqft and 5 other fieldsHigh correlation
sqft_basement is highly overall correlated with Total_sqftHigh correlation
sqft_living is highly overall correlated with Total_sqft and 4 other fieldsHigh correlation
yr_built is highly overall correlated with Age and 2 other fieldsHigh correlation
waterfront is highly imbalanced (93.9%) Imbalance
view is highly imbalanced (71.9%) Imbalance
price is highly skewed (γ1 = 24.79093256) Skewed
price has 49 (1.1%) zeros Zeros
sqft_basement has 2745 (59.7%) zeros Zeros
yr_renovated has 2735 (59.5%) zeros Zeros

Reproduction

Analysis started2024-11-28 09:55:17.051965
Analysis finished2024-11-28 09:55:45.354192
Duration28.3 seconds
Software versionydata-profiling vv4.12.0
Download configurationconfig.json

Variables

date
Date

Distinct70
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size36.1 KiB
Minimum2014-05-02 00:00:00
Maximum2014-07-10 00:00:00
2024-11-28T10:55:45.491742image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:45.740071image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

price
Real number (ℝ)

High correlation  Skewed  Zeros 

Distinct1741
Distinct (%)37.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean551962.99
Minimum0
Maximum26590000
Zeros49
Zeros (%)1.1%
Negative0
Negative (%)0.0%
Memory size36.1 KiB
2024-11-28T10:55:45.991296image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile200000
Q1322875
median460943.46
Q3654962.5
95-th percentile1184050
Maximum26590000
Range26590000
Interquartile range (IQR)332087.5

Descriptive statistics

Standard deviation563834.7
Coefficient of variation (CV)1.0215082
Kurtosis1044.3522
Mean551962.99
Median Absolute Deviation (MAD)157500
Skewness24.790933
Sum2.5390297 × 109
Variance3.1790957 × 1011
MonotonicityNot monotonic
2024-11-28T10:55:46.242166image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 49
 
1.1%
300000 42
 
0.9%
400000 31
 
0.7%
600000 29
 
0.6%
450000 29
 
0.6%
440000 29
 
0.6%
350000 28
 
0.6%
250000 27
 
0.6%
550000 27
 
0.6%
415000 27
 
0.6%
Other values (1731) 4282
93.1%
ValueCountFrequency (%)
0 49
1.1%
7800 1
 
< 0.1%
80000 1
 
< 0.1%
83000 1
 
< 0.1%
83300 2
 
< 0.1%
84350 1
 
< 0.1%
87500 1
 
< 0.1%
90000 2
 
< 0.1%
100000 4
 
0.1%
102500 1
 
< 0.1%
ValueCountFrequency (%)
26590000 1
< 0.1%
12899000 1
< 0.1%
7062500 1
< 0.1%
4668000 1
< 0.1%
4489000 1
< 0.1%
3800000 1
< 0.1%
3710000 1
< 0.1%
3200000 1
< 0.1%
3100000 1
< 0.1%
3000000 1
< 0.1%

bedrooms
Real number (ℝ)

High correlation 

Distinct10
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.4008696
Minimum0
Maximum9
Zeros2
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size36.1 KiB
2024-11-28T10:55:46.505075image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile2
Q13
median3
Q34
95-th percentile5
Maximum9
Range9
Interquartile range (IQR)1

Descriptive statistics

Standard deviation0.90884812
Coefficient of variation (CV)0.26723992
Kurtosis1.2353774
Mean3.4008696
Median Absolute Deviation (MAD)1
Skewness0.45644663
Sum15644
Variance0.8260049
MonotonicityNot monotonic
2024-11-28T10:55:46.702949image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
3 2032
44.2%
4 1531
33.3%
2 566
 
12.3%
5 353
 
7.7%
6 61
 
1.3%
1 38
 
0.8%
7 14
 
0.3%
8 2
 
< 0.1%
0 2
 
< 0.1%
9 1
 
< 0.1%
ValueCountFrequency (%)
0 2
 
< 0.1%
1 38
 
0.8%
2 566
 
12.3%
3 2032
44.2%
4 1531
33.3%
5 353
 
7.7%
6 61
 
1.3%
7 14
 
0.3%
8 2
 
< 0.1%
9 1
 
< 0.1%
ValueCountFrequency (%)
9 1
 
< 0.1%
8 2
 
< 0.1%
7 14
 
0.3%
6 61
 
1.3%
5 353
 
7.7%
4 1531
33.3%
3 2032
44.2%
2 566
 
12.3%
1 38
 
0.8%
0 2
 
< 0.1%

bathrooms
Real number (ℝ)

High correlation 

Distinct26
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.1608152
Minimum0
Maximum8
Zeros2
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size36.1 KiB
2024-11-28T10:55:46.885275image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q11.75
median2.25
Q32.5
95-th percentile3.5
Maximum8
Range8
Interquartile range (IQR)0.75

Descriptive statistics

Standard deviation0.78378107
Coefficient of variation (CV)0.36272471
Kurtosis1.8659047
Mean2.1608152
Median Absolute Deviation (MAD)0.5
Skewness0.61603272
Sum9939.75
Variance0.61431277
MonotonicityNot monotonic
2024-11-28T10:55:47.102980image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
Histogram with fixed size bins (bins=26)
ValueCountFrequency (%)
2.5 1189
25.8%
1 743
16.2%
1.75 629
13.7%
2 427
 
9.3%
2.25 419
 
9.1%
1.5 291
 
6.3%
2.75 276
 
6.0%
3 167
 
3.6%
3.5 162
 
3.5%
3.25 136
 
3.0%
Other values (16) 161
 
3.5%
ValueCountFrequency (%)
0 2
 
< 0.1%
0.75 17
 
0.4%
1 743
16.2%
1.25 3
 
0.1%
1.5 291
 
6.3%
1.75 629
13.7%
2 427
 
9.3%
2.25 419
 
9.1%
2.5 1189
25.8%
2.75 276
 
6.0%
ValueCountFrequency (%)
8 1
 
< 0.1%
6.75 1
 
< 0.1%
6.5 1
 
< 0.1%
6.25 2
 
< 0.1%
5.75 1
 
< 0.1%
5.5 4
 
0.1%
5.25 4
 
0.1%
5 6
 
0.1%
4.75 7
 
0.2%
4.5 29
0.6%

sqft_living
Real number (ℝ)

High correlation 

Distinct566
Distinct (%)12.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2139.347
Minimum370
Maximum13540
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size36.1 KiB
2024-11-28T10:55:47.360117image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Quantile statistics

Minimum370
5-th percentile950
Q11460
median1980
Q32620
95-th percentile3870
Maximum13540
Range13170
Interquartile range (IQR)1160

Descriptive statistics

Standard deviation963.20692
Coefficient of variation (CV)0.45023408
Kurtosis8.2916826
Mean2139.347
Median Absolute Deviation (MAD)570
Skewness1.7235133
Sum9840996
Variance927767.56
MonotonicityNot monotonic
2024-11-28T10:55:47.673862image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1940 32
 
0.7%
1720 32
 
0.7%
1840 31
 
0.7%
1660 31
 
0.7%
2000 30
 
0.7%
1410 29
 
0.6%
1200 28
 
0.6%
1480 28
 
0.6%
1890 27
 
0.6%
1490 27
 
0.6%
Other values (556) 4305
93.6%
ValueCountFrequency (%)
370 1
< 0.1%
380 1
< 0.1%
420 1
< 0.1%
430 1
< 0.1%
490 1
< 0.1%
520 1
< 0.1%
550 1
< 0.1%
560 1
< 0.1%
580 1
< 0.1%
590 2
< 0.1%
ValueCountFrequency (%)
13540 1
< 0.1%
10040 1
< 0.1%
9640 1
< 0.1%
8670 1
< 0.1%
8020 1
< 0.1%
7320 1
< 0.1%
7270 1
< 0.1%
7050 1
< 0.1%
6980 1
< 0.1%
6900 1
< 0.1%

sqft_lot
Real number (ℝ)

Distinct3113
Distinct (%)67.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean14852.516
Minimum638
Maximum1074218
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size36.1 KiB
2024-11-28T10:55:47.931151image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Quantile statistics

Minimum638
5-th percentile1690.8
Q15000.75
median7683
Q311001.25
95-th percentile43560
Maximum1074218
Range1073580
Interquartile range (IQR)6000.5

Descriptive statistics

Standard deviation35884.436
Coefficient of variation (CV)2.416051
Kurtosis219.87299
Mean14852.516
Median Absolute Deviation (MAD)2772
Skewness11.307139
Sum68321574
Variance1.2876928 × 109
MonotonicityNot monotonic
2024-11-28T10:55:48.183919image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
5000 80
 
1.7%
6000 65
 
1.4%
4000 54
 
1.2%
7200 50
 
1.1%
4800 29
 
0.6%
4500 25
 
0.5%
9600 25
 
0.5%
5500 23
 
0.5%
3000 23
 
0.5%
7500 23
 
0.5%
Other values (3103) 4203
91.4%
ValueCountFrequency (%)
638 1
< 0.1%
681 1
< 0.1%
704 1
< 0.1%
746 1
< 0.1%
747 1
< 0.1%
750 1
< 0.1%
779 1
< 0.1%
833 1
< 0.1%
835 1
< 0.1%
844 2
< 0.1%
ValueCountFrequency (%)
1074218 1
< 0.1%
641203 1
< 0.1%
478288 1
< 0.1%
435600 2
< 0.1%
423838 1
< 0.1%
389126 1
< 0.1%
327135 1
< 0.1%
307752 1
< 0.1%
306848 1
< 0.1%
284011 1
< 0.1%

floors
Real number (ℝ)

High correlation 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.5120652
Minimum1
Maximum3.5
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size36.1 KiB
2024-11-28T10:55:48.375893image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1.5
Q32
95-th percentile2
Maximum3.5
Range2.5
Interquartile range (IQR)1

Descriptive statistics

Standard deviation0.53828838
Coefficient of variation (CV)0.35599548
Kurtosis-0.53885198
Mean1.5120652
Median Absolute Deviation (MAD)0.5
Skewness0.55144065
Sum6955.5
Variance0.28975438
MonotonicityNot monotonic
2024-11-28T10:55:48.560323image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
1 2174
47.3%
2 1811
39.4%
1.5 444
 
9.7%
3 128
 
2.8%
2.5 41
 
0.9%
3.5 2
 
< 0.1%
ValueCountFrequency (%)
1 2174
47.3%
1.5 444
 
9.7%
2 1811
39.4%
2.5 41
 
0.9%
3 128
 
2.8%
3.5 2
 
< 0.1%
ValueCountFrequency (%)
3.5 2
 
< 0.1%
3 128
 
2.8%
2.5 41
 
0.9%
2 1811
39.4%
1.5 444
 
9.7%
1 2174
47.3%

waterfront
Categorical

Imbalance 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size36.1 KiB
0
4567 
1
 
33

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters4600
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 4567
99.3%
1 33
 
0.7%

Length

2024-11-28T10:55:48.752913image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-11-28T10:55:48.905200image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
ValueCountFrequency (%)
0 4567
99.3%
1 33
 
0.7%

Most occurring characters

ValueCountFrequency (%)
0 4567
99.3%
1 33
 
0.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 4600
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 4567
99.3%
1 33
 
0.7%

Most occurring scripts

ValueCountFrequency (%)
Common 4600
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 4567
99.3%
1 33
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4600
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 4567
99.3%
1 33
 
0.7%

view
Categorical

Imbalance 

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size36.1 KiB
0
4140 
2
 
205
3
 
116
4
 
70
1
 
69

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters4600
Distinct characters5
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row4
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 4140
90.0%
2 205
 
4.5%
3 116
 
2.5%
4 70
 
1.5%
1 69
 
1.5%

Length

2024-11-28T10:55:49.061040image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-11-28T10:55:49.224570image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
ValueCountFrequency (%)
0 4140
90.0%
2 205
 
4.5%
3 116
 
2.5%
4 70
 
1.5%
1 69
 
1.5%

Most occurring characters

ValueCountFrequency (%)
0 4140
90.0%
2 205
 
4.5%
3 116
 
2.5%
4 70
 
1.5%
1 69
 
1.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 4600
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 4140
90.0%
2 205
 
4.5%
3 116
 
2.5%
4 70
 
1.5%
1 69
 
1.5%

Most occurring scripts

ValueCountFrequency (%)
Common 4600
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 4140
90.0%
2 205
 
4.5%
3 116
 
2.5%
4 70
 
1.5%
1 69
 
1.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4600
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 4140
90.0%
2 205
 
4.5%
3 116
 
2.5%
4 70
 
1.5%
1 69
 
1.5%

condition
Categorical

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size36.1 KiB
3
2875 
4
1252 
5
435 
2
 
32
1
 
6

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters4600
Distinct characters5
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3
2nd row5
3rd row4
4th row4
5th row4

Common Values

ValueCountFrequency (%)
3 2875
62.5%
4 1252
27.2%
5 435
 
9.5%
2 32
 
0.7%
1 6
 
0.1%

Length

2024-11-28T10:55:49.418363image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-11-28T10:55:49.602910image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
ValueCountFrequency (%)
3 2875
62.5%
4 1252
27.2%
5 435
 
9.5%
2 32
 
0.7%
1 6
 
0.1%

Most occurring characters

ValueCountFrequency (%)
3 2875
62.5%
4 1252
27.2%
5 435
 
9.5%
2 32
 
0.7%
1 6
 
0.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 4600
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 2875
62.5%
4 1252
27.2%
5 435
 
9.5%
2 32
 
0.7%
1 6
 
0.1%

Most occurring scripts

ValueCountFrequency (%)
Common 4600
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
3 2875
62.5%
4 1252
27.2%
5 435
 
9.5%
2 32
 
0.7%
1 6
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4600
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 2875
62.5%
4 1252
27.2%
5 435
 
9.5%
2 32
 
0.7%
1 6
 
0.1%

sqft_above
Real number (ℝ)

High correlation 

Distinct511
Distinct (%)11.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1827.2654
Minimum370
Maximum9410
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size36.1 KiB
2024-11-28T10:55:49.836915image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Quantile statistics

Minimum370
5-th percentile860
Q11190
median1590
Q32300
95-th percentile3440
Maximum9410
Range9040
Interquartile range (IQR)1110

Descriptive statistics

Standard deviation862.16898
Coefficient of variation (CV)0.47183565
Kurtosis4.0701383
Mean1827.2654
Median Absolute Deviation (MAD)490
Skewness1.4942107
Sum8405421
Variance743335.34
MonotonicityNot monotonic
2024-11-28T10:55:50.076341image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1010 47
 
1.0%
1200 47
 
1.0%
1300 45
 
1.0%
1140 44
 
1.0%
1320 43
 
0.9%
1150 42
 
0.9%
1180 40
 
0.9%
1090 40
 
0.9%
1400 38
 
0.8%
1050 37
 
0.8%
Other values (501) 4177
90.8%
ValueCountFrequency (%)
370 1
 
< 0.1%
380 1
 
< 0.1%
420 1
 
< 0.1%
430 1
 
< 0.1%
490 1
 
< 0.1%
520 1
 
< 0.1%
550 3
0.1%
560 1
 
< 0.1%
580 1
 
< 0.1%
590 2
< 0.1%
ValueCountFrequency (%)
9410 1
< 0.1%
8020 1
< 0.1%
7680 1
< 0.1%
7320 1
< 0.1%
6640 1
< 0.1%
6430 1
< 0.1%
6420 1
< 0.1%
6120 1
< 0.1%
6070 1
< 0.1%
6050 1
< 0.1%

sqft_basement
Real number (ℝ)

High correlation  Zeros 

Distinct207
Distinct (%)4.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean312.08152
Minimum0
Maximum4820
Zeros2745
Zeros (%)59.7%
Negative0
Negative (%)0.0%
Memory size36.1 KiB
2024-11-28T10:55:50.357934image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q3610
95-th percentile1210
Maximum4820
Range4820
Interquartile range (IQR)610

Descriptive statistics

Standard deviation464.13723
Coefficient of variation (CV)1.4872307
Kurtosis4.08238
Mean312.08152
Median Absolute Deviation (MAD)0
Skewness1.6427322
Sum1435575
Variance215423.37
MonotonicityNot monotonic
2024-11-28T10:55:50.627317image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 2745
59.7%
500 53
 
1.2%
600 45
 
1.0%
800 43
 
0.9%
900 41
 
0.9%
700 38
 
0.8%
1000 33
 
0.7%
400 33
 
0.7%
550 27
 
0.6%
750 26
 
0.6%
Other values (197) 1516
33.0%
ValueCountFrequency (%)
0 2745
59.7%
20 1
 
< 0.1%
50 1
 
< 0.1%
60 2
 
< 0.1%
65 1
 
< 0.1%
70 1
 
< 0.1%
80 3
 
0.1%
90 2
 
< 0.1%
100 14
 
0.3%
110 2
 
< 0.1%
ValueCountFrequency (%)
4820 1
< 0.1%
4130 1
< 0.1%
2850 1
< 0.1%
2730 1
< 0.1%
2550 2
< 0.1%
2360 1
< 0.1%
2330 1
< 0.1%
2300 1
< 0.1%
2200 1
< 0.1%
2180 1
< 0.1%

yr_built
Real number (ℝ)

High correlation 

Distinct115
Distinct (%)2.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1970.7863
Minimum1900
Maximum2014
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size36.1 KiB
2024-11-28T10:55:51.259687image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Quantile statistics

Minimum1900
5-th percentile1913
Q11951
median1976
Q31997
95-th percentile2009
Maximum2014
Range114
Interquartile range (IQR)46

Descriptive statistics

Standard deviation29.731848
Coefficient of variation (CV)0.015086287
Kurtosis-0.6700759
Mean1970.7863
Median Absolute Deviation (MAD)23
Skewness-0.50215519
Sum9065617
Variance883.98281
MonotonicityNot monotonic
2024-11-28T10:55:51.535501image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2006 111
 
2.4%
2005 104
 
2.3%
2007 93
 
2.0%
2004 92
 
2.0%
1978 90
 
2.0%
2008 89
 
1.9%
2003 89
 
1.9%
1967 82
 
1.8%
1977 80
 
1.7%
2014 78
 
1.7%
Other values (105) 3692
80.3%
ValueCountFrequency (%)
1900 22
0.5%
1901 9
 
0.2%
1902 10
 
0.2%
1903 10
 
0.2%
1904 9
 
0.2%
1905 19
0.4%
1906 27
0.6%
1907 12
0.3%
1908 19
0.4%
1909 22
0.5%
ValueCountFrequency (%)
2014 78
1.7%
2013 57
1.2%
2012 33
 
0.7%
2011 24
 
0.5%
2010 28
 
0.6%
2009 50
1.1%
2008 89
1.9%
2007 93
2.0%
2006 111
2.4%
2005 104
2.3%

yr_renovated
Real number (ℝ)

Zeros 

Distinct60
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean808.60826
Minimum0
Maximum2014
Zeros2735
Zeros (%)59.5%
Negative0
Negative (%)0.0%
Memory size36.1 KiB
2024-11-28T10:55:51.773919image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q31999
95-th percentile2011
Maximum2014
Range2014
Interquartile range (IQR)1999

Descriptive statistics

Standard deviation979.41454
Coefficient of variation (CV)1.2112349
Kurtosis-1.8511109
Mean808.60826
Median Absolute Deviation (MAD)0
Skewness0.3859187
Sum3719598
Variance959252.83
MonotonicityNot monotonic
2024-11-28T10:55:52.023277image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 2735
59.5%
2000 170
 
3.7%
2003 151
 
3.3%
2009 109
 
2.4%
2001 109
 
2.4%
2005 95
 
2.1%
2004 77
 
1.7%
2014 72
 
1.6%
2006 68
 
1.5%
2013 61
 
1.3%
Other values (50) 953
 
20.7%
ValueCountFrequency (%)
0 2735
59.5%
1912 33
 
0.7%
1913 1
 
< 0.1%
1923 57
 
1.2%
1934 6
 
0.1%
1945 7
 
0.2%
1948 1
 
< 0.1%
1953 1
 
< 0.1%
1954 8
 
0.2%
1955 2
 
< 0.1%
ValueCountFrequency (%)
2014 72
1.6%
2013 61
1.3%
2012 45
1.0%
2011 54
1.2%
2010 30
 
0.7%
2009 109
2.4%
2008 45
1.0%
2007 7
 
0.2%
2006 68
1.5%
2005 95
2.1%

Age
Real number (ℝ)

High correlation 

Distinct115
Distinct (%)2.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean53.213696
Minimum10
Maximum124
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size36.1 KiB
2024-11-28T10:55:52.260731image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Quantile statistics

Minimum10
5-th percentile15
Q127
median48
Q373
95-th percentile111
Maximum124
Range114
Interquartile range (IQR)46

Descriptive statistics

Standard deviation29.731848
Coefficient of variation (CV)0.55872549
Kurtosis-0.6700759
Mean53.213696
Median Absolute Deviation (MAD)23
Skewness0.50215519
Sum244783
Variance883.98281
MonotonicityNot monotonic
2024-11-28T10:55:52.506621image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
18 111
 
2.4%
19 104
 
2.3%
17 93
 
2.0%
20 92
 
2.0%
46 90
 
2.0%
16 89
 
1.9%
21 89
 
1.9%
57 82
 
1.8%
47 80
 
1.7%
10 78
 
1.7%
Other values (105) 3692
80.3%
ValueCountFrequency (%)
10 78
1.7%
11 57
1.2%
12 33
 
0.7%
13 24
 
0.5%
14 28
 
0.6%
15 50
1.1%
16 89
1.9%
17 93
2.0%
18 111
2.4%
19 104
2.3%
ValueCountFrequency (%)
124 22
0.5%
123 9
 
0.2%
122 10
 
0.2%
121 10
 
0.2%
120 9
 
0.2%
119 19
0.4%
118 27
0.6%
117 12
0.3%
116 19
0.4%
115 22
0.5%

Total_sqft
Real number (ℝ)

High correlation 

Distinct659
Distinct (%)14.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2451.4285
Minimum370
Maximum17670
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size36.1 KiB
2024-11-28T10:55:52.742590image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Quantile statistics

Minimum370
5-th percentile950
Q11547.5
median2260
Q33042.5
95-th percentile4610.5
Maximum17670
Range17300
Interquartile range (IQR)1495

Descriptive statistics

Standard deviation1242.1942
Coefficient of variation (CV)0.50672261
Kurtosis10.006532
Mean2451.4285
Median Absolute Deviation (MAD)750
Skewness1.8799238
Sum11276571
Variance1543046.5
MonotonicityNot monotonic
2024-11-28T10:55:52.988599image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1150 28
 
0.6%
1480 28
 
0.6%
1010 26
 
0.6%
1940 26
 
0.6%
2120 24
 
0.5%
2550 24
 
0.5%
1700 23
 
0.5%
2670 23
 
0.5%
1200 23
 
0.5%
1590 23
 
0.5%
Other values (649) 4352
94.6%
ValueCountFrequency (%)
370 1
< 0.1%
380 1
< 0.1%
420 1
< 0.1%
430 1
< 0.1%
490 1
< 0.1%
520 1
< 0.1%
550 1
< 0.1%
560 1
< 0.1%
580 1
< 0.1%
590 2
< 0.1%
ValueCountFrequency (%)
17670 1
< 0.1%
14460 1
< 0.1%
12400 1
< 0.1%
11220 1
< 0.1%
9780 1
< 0.1%
9040 1
< 0.1%
8980 1
< 0.1%
8630 1
< 0.1%
8550 1
< 0.1%
8330 1
< 0.1%

Interactions

2024-11-28T10:55:42.525199image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:18.004965image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:20.779190image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:22.912390image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:25.010218image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:27.198104image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:29.669006image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:31.746323image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:33.765089image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:35.816055image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:37.959823image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:40.483864image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:42.710982image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:18.164139image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:20.974116image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:23.080356image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:25.186088image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:27.366087image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:29.833140image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:31.913318image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:33.932396image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:35.975899image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:38.128159image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:40.655853image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:42.872636image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:18.305327image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:21.146890image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:23.231486image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:25.348074image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:27.535340image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:29.989396image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:32.058617image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:34.076055image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:36.137197image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:38.292711image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:40.802236image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:43.069087image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:18.480677image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:21.316888image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:23.406250image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:25.558927image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:27.719352image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:30.168082image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:32.237167image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:34.262067image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:36.317189image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:38.501837image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:40.999084image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:43.273390image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:18.646241image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:21.495233image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:23.596856image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:25.742100image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:27.899451image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:30.349882image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:32.407756image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:34.448263image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:36.506148image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:38.692140image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:41.187009image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:43.454117image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:18.820186image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:21.649147image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:23.760996image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:25.924388image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:28.067331image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:30.531032image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:32.576691image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:34.617037image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:36.696801image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:39.206100image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:41.346242image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:43.632866image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:18.972775image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:21.848243image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:23.926556image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:26.100712image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:28.232247image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:30.694649image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:32.744395image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:34.784108image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:36.854915image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:39.375289image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:41.518411image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:43.791980image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:19.133043image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:22.053127image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:24.083341image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:26.270327image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:28.396604image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:30.864723image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:32.904398image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:34.949897image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:37.018389image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:39.552132image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:41.673432image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:43.972525image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:20.086489image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:22.214604image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:24.247957image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:26.456454image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:28.955632image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:31.042552image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:33.072168image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:35.118079image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:37.202411image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:39.730740image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:41.834114image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:44.144830image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:20.245479image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:22.360240image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:24.423563image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:26.632039image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:29.123389image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:31.208829image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:33.239891image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:35.285404image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:37.432064image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:39.920849image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:42.002242image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:44.336992image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:20.427121image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:22.566611image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:24.633462image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:26.816521image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:29.309034image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:31.392884image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:33.426286image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:35.471661image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:37.619301image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:40.110749image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:42.179384image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:44.509513image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:20.597700image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:22.724262image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:24.807041image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:26.999305image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:29.486065image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:31.564891image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:33.585953image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:35.639218image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:37.785295image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:40.288609image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
2024-11-28T10:55:42.340803image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Correlations

2024-11-28T10:55:53.184704image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
AgeTotal_sqftbathroomsbedroomsconditionfloorspricesqft_abovesqft_basementsqft_livingsqft_lotviewwaterfrontyr_builtyr_renovated
Age1.000-0.191-0.530-0.1600.265-0.538-0.084-0.4600.212-0.3220.0120.0530.027-1.0000.315
Total_sqft-0.1911.0000.6750.6310.0300.2170.6020.6280.5930.9430.2880.2100.2350.191-0.085
bathrooms-0.5300.6751.0000.5380.1290.5400.4920.6960.1900.7470.0920.1460.1660.530-0.213
bedrooms-0.1600.6310.5381.0000.0660.2200.3380.5330.2480.6520.2380.0860.0000.160-0.056
condition0.2650.0300.1290.0661.0000.1860.0000.1080.1170.0460.0520.0270.0000.2650.217
floors-0.5380.2170.5400.2200.1861.0000.3210.604-0.2880.397-0.2040.0330.0000.538-0.229
price-0.0840.6020.4920.3380.0000.3211.0000.5340.2370.6310.0750.0940.2260.084-0.071
sqft_above-0.4600.6280.6960.5330.1080.6040.5341.000-0.1720.8430.3050.1020.1340.460-0.169
sqft_basement0.2120.5930.1900.2480.117-0.2880.237-0.1721.0000.3230.0230.1950.211-0.2120.054
sqft_living-0.3220.9430.7470.6520.0460.3970.6310.8430.3231.0000.3250.1730.2690.322-0.127
sqft_lot0.0120.2880.0920.2380.052-0.2040.0750.3050.0230.3251.0000.0490.000-0.0120.051
view0.0530.2100.1460.0860.0270.0330.0940.1020.1950.1730.0491.0000.4830.0550.050
waterfront0.0270.2350.1660.0000.0000.0000.2260.1340.2110.2690.0000.4831.0000.0260.000
yr_built-1.0000.1910.5300.1600.2650.5380.0840.460-0.2120.322-0.0120.0550.0261.000-0.315
yr_renovated0.315-0.085-0.213-0.0560.217-0.229-0.071-0.1690.054-0.1270.0510.0500.000-0.3151.000

Missing values

2024-11-28T10:55:44.769523image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-11-28T10:55:45.176936image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

datepricebedroomsbathroomssqft_livingsqft_lotfloorswaterfrontviewconditionsqft_abovesqft_basementyr_builtyr_renovatedAgeTotal_sqft
02014-05-02 00:00:00313000.03.01.50134079121.50031340019552005691340
12014-05-02 00:00:002384000.05.02.50365090502.00453370280192101033930
22014-05-02 00:00:00342000.03.02.001930119471.00041930019660581930
32014-05-02 00:00:00420000.03.02.25200080301.00041000100019630613000
42014-05-02 00:00:00550000.04.02.501940105001.0004114080019761992482740
52014-05-02 00:00:00490000.02.01.0088063801.000388001938199486880
62014-05-02 00:00:00335000.02.02.00135025601.00031350019760481350
72014-05-02 00:00:00482000.04.02.502710358682.00032710019890352710
82014-05-02 00:00:00452500.03.02.502430884261.0004157086019850393290
92014-05-02 00:00:00640000.04.02.00152062001.50031520019452010791520
datepricebedroomsbathroomssqft_livingsqft_lotfloorswaterfrontviewconditionsqft_abovesqft_basementyr_builtyr_renovatedAgeTotal_sqft
45902014-07-08 00:00:00380680.5555564.02.50262083312.00032620019910332620
45912014-07-08 00:00:00396166.6666673.01.75188057521.000494094019450792820
45922014-07-08 00:00:00252980.0000004.02.50253081692.00032530019930312530
45932014-07-08 00:00:00289373.3076923.02.50253846002.00032538020131923112538
45942014-07-09 00:00:00210614.2857143.02.50161072232.00031610019940301610
45952014-07-09 00:00:00308166.6666673.01.75151063601.00041510019541979701510
45962014-07-09 00:00:00534333.3333333.02.50146075732.00031460019832009411460
45972014-07-09 00:00:00416904.1666673.02.50301070142.00033010020090153010
45982014-07-10 00:00:00203400.0000004.02.00209066301.00031070102019740503110
45992014-07-10 00:00:00220600.0000003.02.50149081022.00041490019900341490